Real-Time Pose Estimation for Event Cameras with Stacked Spatial LSTM Networks
نویسندگان
چکیده
We present a new method to estimate the 6DOF pose of the event camera solely based on the event stream. Our method first creates the event image from a list of events that occurs in a very short time interval, then a Stacked Spatial LSTM Network (SP-LSTM) is used to learn and estimate the camera pose. Our SP-LSTM comprises a CNN to learn deep features from the event images and a stack of LSTM to learn spatial dependencies in the image features space. We show that the spatial dependency plays an important role in the pose estimation task and the SP-LSTM can effectively learn that information. The experimental results on the public dataset show that our approach outperforms recent methods by a substantial margin. Overall, our proposed method reduces about 6 times the position error and 3 times the orientation error over the state of the art. The source code and trained models will be released.
منابع مشابه
Deep Stacked Bidirectional and Unidirectional LSTM Recurrent Neural Network for Network-wide Traffic Speed Prediction
Short-term traffic forecasting based on deep learning methods, especially long-term short memory (LSTM) neural networks, received much attention in recent years. However, the potential of deep learning methods is far from being fully exploited in terms of the depth of the architecture, the spatial scale of the prediction area, and the prediction power of spatial-temporal data. In this paper, a ...
متن کاملNeuromorphic Event-Based 3D Pose Estimation
Pose estimation is a fundamental step in many artificial vision tasks. It consists of estimating the 3D pose of an object with respect to a camera from the object's 2D projection. Current state of the art implementations operate on images. These implementations are computationally expensive, especially for real-time applications. Scenes with fast dynamics exceeding 30-60 Hz can rarely be proces...
متن کاملExtended Kalman Filter Based Pose Estimation Using Multiple Cameras
In this work, we solve the pose estimation problem for robot motion by placing multiple cameras on the robot. In particular, we combine the Extended Kalman Filter (EKF) with the multiple cameras. An essential strength of our approach is that it does not require finding image feature correspondences among cameras which is a difficult classical problem. The initial pose, the tracked features, and...
متن کاملStacked Hourglass Networks for Human Pose Estimation
This work introduces a novel Convolutional Network architecture for the task of human pose estimation. Features are processed across all scales and consolidated to best capture the various spatial relationships associated with the body. We show how repeated bottom-up, top-down processing used in conjunction with intermediate supervision is critical to improving the performance of the network. W...
متن کاملContinuous-Time Visual-Inertial Trajectory Estimation with Event Cameras
In contrast to traditional cameras, which output images at a fixed rate, event cameras have independent pixels that output asynchronous pixel-level brightness changes with microsecond resolution. In this paper, we leverage a continuoustime framework to perform trajectory estimation by fusing visual data from a moving event camera with inertial data from an IMU. This framework allows direct inte...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1708.09011 شماره
صفحات -
تاریخ انتشار 2017